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1.  Introduction 


In  this  report  we  analyze  the  probability  distribution  of  the  length  of 
a  failure- free  reliability  test  and  the  conditional  reliability  r(t,  At)  of 
components  undergoing  the  reliability  test.  The  conditional  reliability  of 
a  component  we  define  as  the  probability  of  a  component  not  failing  for  At 
time  units  in  the  future,  given  that  the  component  is  operating  at  time  t, 
r (t,  jt)  •  ?  [  T  >  t  +  ^  |  T  >  t  ]  (1) 

We  assume  a  two-parameter  Weibull  life-length  distribution  with  scale  para¬ 
meter  $  and  shape  parameter  g.  When  g  *  1,  the  Weibull  coincides  with  the 
exponential  distribution,  so  that  (1)  reduces  to  the  reliability  at  time  &t, 
r(t,  ^t)  =  R  (At)  =  P  [  T  >  At  ]  (2) 

Although  the  results  contained  herein  are  valid  for  any  values  of  g  >  0,  we 
restrict  ourselves  to  a  reliability  growth  situation  in  which  g  <  1. 

We  use  these  ideas  to  compute  the  expected  length  of  a  failure-free 
test  and  contrast  it  with  a  fixed-length  test  whose  length  is  equal  to  the 
expected  length  of  the  failure- free  test.  We  can  then  examine  the  condi¬ 
tional  reliability  of  a  component  at  the  end  of  the  original  burn-in  time. 
The  results  of  this  report  can  be  used  by  the  test  engineer  to  answer  the 
following  questions: 

(a)  How  much  longer  than  the  original  burn-in  time  can  a  failure- free 
test  be  expected  to  run? 

(b)  How  many  more  failures  can  be  expected  in  a  fixed-time  test  with 
the  same  length  as  the  failure-free  test? 

Based  on  the  answers  to  these  questions,  the  test  engineer  can  decide  which 
reliability  test,  fixed-length  or  failure- free,  would  be  more  useful.  For 
ease  of  application,  we  provide  some  charts  which  can  be  easily  used  to  de¬ 
termine  the  length  and  conditional  reliability  of  a  failure-free  test. 
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In  section  2  of  this  report  we  describe  in  detail  a  failure-free  relia¬ 
bility  test.  In  order  to  perform  the  analysis ,  we  must  malte  a  number  of 
assumptions  which  we  explicitly  state  in  section  3.  In  section  4  we  derive 
the  equation  of  the  probability  distribution  of  the  test  length,  and  in 
section  5  we  derive  the  conditional  reliability  equation.  In  section  6 
we  discuss  briefly  the  numerical  solutions  of  these  equations  and  the  es¬ 
timation  of  parameters.  In  section  7  we  describe  the  testing  procedure  and 
provide  some  numerical  examples.  Appendix  A  contains  charts  for  the  test 
length,  and  Appendix  B  contains  charts  for  the  conditional  reliability. 


2.  Failure-Free  Reliability  Tests 

A  failure-free  reliability  test  is  one  in  which  a  system  must  pass  a 
fixed  time,  t^,  free  of  failure  before  being  accepted.  If  a  failure  is  en¬ 
countered  before  time  t^,  the  system  is  repaired  and  a  new  burn-in  period  of 
length  tg  is  re-established.  This  reliability  testing  philosophy  has  merit 
in  those  situations  in  which  there  is  suspected  non-homogeneity  of  quality 
among  various  systems.  In  those  situations,  those  systems  that  are  1  likely  to 
fail1  will  receive  a  great  deal  of  testing  before  being  accepted.  On  the  other 
hand,  other  systems  will  be  accepted  with  the  original  burn-in  time  t^. 

This  type  of  reliability  testing  procedure  may  not  be  optimal  in  cer¬ 
tain  other  situations.  For  instance,  it  may  be  difficult  to  justify  an 
a  priori  assumption  of  non-homogeneity  of  quality  among  systems.  In  fact, 
the  systems  may  all  have  identical  time-to- failure  probability  distributions 
which  are  unchanged  following  repair.  However,  due  to  strictly  random  failure 
times,  some  components  may  appear  to  be  ’likely  to  fail*  and  thus  receive  an 
inordinate  amount  of  burn-in.  Other  components,  on  the  other  hand,  may  be 
accepted  after  time  t^  although  they  would  have  failed  shortly  after. 

A  simple  example  of  the  above  is  a  series  system  made  up  of  n  devices. 
Suppose  that  the  times  to  failure  of  all  of  the  devices  are  independent,  iden¬ 
tically  distributed  (iid) ,  with  an  exponential  life-length  distribution 

ft  (t)  -  \e'Xt,  i  -  1,  2 . n  (3) 

Since  this  series  system  will  fail  when  the  first  device  fails,  it  follows 
that  the  system's  life- length  distribution  is  that  of  the  smallest  order 
statistic,  which  is 

f(t)  •  n  \  e'1**,  (4) 

again  exponential.  Now,  if  the  system  fails  and  the  Inoperative  device  is 


replaced  by  one  with  an  Identical  life-length  distribution,  then  the  memory- 
less  property  of  the  exponential  guarantees  that  the  system  will  have  the 
same  life-length  distribution  (1) .  It  is  erroneous,  then  to  conclude  that 
this  system  requires  a  longer  burn-in  period  than  the  others,  since  in  fact 
they  ail  have  identical  life-length  distributions.  If  the  device  life-length 
distribution  is  Welbull,  then  the  smallest  order  statistic  still  has  a  Weibull 
distribution.  In  this  case,  however,  the  reasoning  is  not  as  direct,  since 
the  other  components  have  'aged1  and  therefore  have  a  different  life-length 
distribution  from  that  of  the  repaired  system,  lu  this  report  we  examine 
this  situation,  subject  to  certain  background  assumptions. 
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3.  Problem  Formulation 

The  model  which  we  consider  is  based  on  the  following  assumptions: 

(i)  The  life-lengths  of  the  system  are  iid  following  a  two-parameter 


Weibull  distribution: 

f(t)  =  P/6  (t/6)8'1  exp  [-(t/fif],  t  >  0,  6  >  0,  0  <  e  S  1  (5) 

The  restriction  that  g  <  1  is  not  strictly  necessary.  For  the 
purpose  of  this  report,  however,  we  restrict  ourselves  to  this 

case,  which  represents  a  reliability  growth  situation  (or:  de-  \ 

\ 

creasing  hazard  rate).  The  parameter  5  is  referred  to  as  the  j 

characteristic  life,  in  that  63.2%  of  the  components  on  the  j 

average  will  fail  before  5,  independent  of  the  value  of  g .  In 

later  sections,  we  will  re-parametrize  the  terms  of  f: 

V  *  t/6  (6) 

and  thus  interpret  y  as  a  fraction  of  the  characteristic  life. 

The  Mean  Time  Between  Failure  (MTBF)  for  such  a  system  is  given  by 
MTBF  -  6  T  (1  +  1/p)  (7) 

The  reliability  function,  R(t),  is  given  by 

R(t)  -  exp  [-(t/6)]e  (8) 

If  g  *  1,  this  distribution  reduces  to  an  exponential  with 
MTBF  •  $. 

(ii)  All  of  the  systems  are  simultaneously  burned  in  for  t^  time  units. 

If  a  system  fails  before  time  t^,  it  is  removed  from  test,  re¬ 
paired,  and  its  post-repair  condition  is  the  same  as  a  new  sys¬ 
tem.  That  is,  its  life-length  distribution  is  as  given  in  (5). 

We  further  assume  that  there  is  independence  between  the  life- 
lengths  before  and  after  repair.  This  would  not  be  the  case, 
for  example,  if  there  were  a  conmon  source  of  failure  which  went 
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undiagnosed.  Assuming  that  the  true  cause  of  failure  were  diagnosed  and 
corrected  each  time,  however,  independence  of  life  lengths  is  not  an  un¬ 
reasonable  assumption.  Since  it  takes  some  time  to  effect  the  repair,  a 
new  burn-in  cycle  for  the  repaired  system  is  begun  at  the  end  of  the  ori¬ 
ginal  cycle;  that  is  at  time  t^.  All  repaired  systems  are  then  burned  in 
from  time  tQ  until  time  2tQ,  with  the  process  being  repeated  for  those  sys¬ 
tems  which  fail  once  again.  A  system  is  not  accepted  until  it  passes  a 
time  period  of  length  t^  which  is  failure- free. 


4.  Length  of  a  Failure-Free  Test  (one  system) 

The  length,  L  of  a  failure- free  test  is  a  random  variable,  and  under 
the  assumptions  above, 

P  [L  »  ictg]  *  P  [Failure  during  first  It- 1  cycles  and  success  during 
the  lcth  cycle] 

By  independence  it  follows  that 

P  [L  *  IUq]  -  [1  -  R(t)] lt”1  R(t),  It  -  1,  2,  3,  ...  (9) 

which,  using  (8)  becomes 

P  [L  -  kt^  =  {1  -  exp  [-(t0/6)e]}k‘l  exp  [-(tQ/6)e]  (10) 

If  we  define  the  random  variable 

W  *  L/t0  (11) 

(where  W  =  number  of  burn-in  periods  required  to  'failure  free') 
then  W  follows  a  geometric  distribution. 

From  this  observation  it  follows  that 

E (W)  =  exp  (tQ/6)B  (12) 

and 

Var  (W)  -  exp  [2(tQ/6)B]  -  exp  (t^  /6)P  (13) 

Using  (11),  therefore,  we  obtain 

E(L)  -  tQ  exp  (tQ/6)P  (14) 

and 

Var  (L)  -  (exp  [2(tQ/6)e]  -  exp  (tQ/§)p} 

Of  more  usefulness  than  expected  value  of  test  length,  E(L),  is  the 
expected  proportional  excess  test  time  over  t^,  which  we  define  as 

Lx  -  (E(L)  -  tQ)/t0  (15) 

Using  (11),  we  obtain 

Lx  ■  exp  ■  1  (16) 
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5,  Probability  of  Failure 


The  time  inter/al 

I  -  (t0,  E(D)  (17) 

represents  the  expected  additional  time  that  a  failure-free  test  extends  over 
a  fixed-time  test  of  length  t^.  We  now  consider  the  system  reliability  over 
the  time  interval  I.  Specifically,  we  derive  the  probability  of  system  fail¬ 
ure  over  I.  If  this  probability  is  sufficiently  high,  then  rather  than  use 
a  failure- free  procedure,  the  test  engineer  would  want  to  extend  the  test  to 
time  E(L)  equally  for  all  systems.  In  either  case,  the  expected  total  test 
time  devoted  to  all  systems  is  the  same,  namely 

Total  Test  Time  -  n  E  (L)  (18) 

Therefore,  the  test  engineer  can  make  a  decision  at  time  between  two 
approaches  which  have  approximately  the  same  cost  (since  the  total  test  times 
are  equal,  and  assuming  the  test  cost  is  proportional  to  its  length),  but  one 
of  which  will  have  a  higher  pay-off  in  terms  of  defects  found. 

We  are  interested  in 

h(t0  ;  6,  e)  *  p  [Failure  in  I  |  T  >  tQ1  (19) 

Such  a  conditional  probability  can  be  interpreted  as  the  fraction  of 
those  systems  still  operating  at  time  t^  which  will  fail  during  time  inter¬ 
val  I.  The  conditional  reliability  will  then  be  given  by 


r(tQ,  E (L)  -  tQ)  -  1  -  h  (tQ;  g,  g).  (20) 

Since  h(tQ;  5,  g)  -  P  [Failure  in  I] /P  [T  >  (21) 

-  fR(t0)  -  R[E(L)]) /R(t0)  (22) 

-  1  -  R  [E(L)] /R(t0)  (23) 

we  have 

h(tQ;  5,  g)  -  1  -  exp  { <tQ/« )^  [1-exp  (0(tQ/6)®)]}  (24) 


(25) 


r(t0,  E(L)  -  tQ)  -  exp  {(tQ/6)P  [1  -  exp  (p  y  B)]} 

6.  Expected  Repeat  Failures 

The  expected  additional  number  of  repeat  failures  for  a  failure- free  test, 
RFF,  may  be  computed  using  the  developed  distributions  as 

RFF  =  n  [  exp  <tQ/6 )P -1  ]  -  n [1  -  exp  -  (tQ/«)B].  (26) 

These  two  terms  may  be  interpreted  as  the  total  expected  number  of  failures 
and  the  expected  number  of  failures  in  the  first  burn-in  period,  respectively. 
If  a  fixed-time  test  is  used,  and  failed  items  are  replaced,  then  the  expected 
additional  number  of  repeat  failures,  RFT,  may  be  computed  by 

RFT  *  r  [1  -  exp  {  -  (L^/S)*}]  (27) 

where  r  *  number  of  failures  in  time  t^. 


9 


6.  Numerical  Methods 


Included  with  this  report  is  a  set  of  charts  (Appendices  A,  B)  which  con¬ 
tain  solutions  of  equations  16  and  24  for  and  h  as  a  function  of  the  para¬ 
meters  g  and  y  -  t/$.  These  charts  were  produced  using  a  Newton-Raphson  al¬ 
gorithm,  solving  iteratively  for  Y  in  the  equations 


Y  - 
n 


a*  [  h(to; 


(28) 


Y 


n 


-  M-  ■  »  -  V 

»V  l,  «*.  *» 


(29) 


where  h*  and  L^*  are  constants.  Figures  1  through  5  are  solutions  of  h  and 
Figures  6  through  9  are  solutions  of  L^.  The  figures  differ  in  their  range 
of  values,  as  listed  in  Tables  I  and  II.  The  computations  were  performed  on 


the  Air  Force  Academy's  Burroughs  B6700  computer. 


Table  I.  Ranges  for  h(y,  g)  graphs 


FiRure 

Sl 

X 

h 

1 

0.0 

- 

1.0 

0.00 

- 

0.01 

0.01  - 

0.10 

2 

0.0 

- 

1.0 

0.00 

- 

0.01 

0.10  - 

1.00 

3 

0.0 

- 

1.0 

0.01 

- 

0.10 

0.02  - 

0.10 

4 

0.0 

- 

1.0 

0.01 

- 

0.10 

0.10  - 

1.00 

5 

0.0 

1.0 

0.10 

- 

1.00 

0.20  - 

1.00 
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Table  II.  Ranges  for  LX(V»  p)  graphs 


Figure 

ft 

V 

L 

X 

6 

0.0 

- 

1.0 

0.00  -  0.01 

.001 

-  .007 

7 

0.0 

- 

1.0 

0.01  -  0.10 

.01 

-  .08 

8 

0.0 

- 

1.0 

0.00  -  1.00 

.01 

-  .10 

9 

0.0 

- 

1.0 

0.20  -  1.00 

.10 

-  .80 

In  order  to  use  these  charts,  the  parameters  $  and  P  must  be  known.  Since 
there  is  vast  literature  concerning  this  problem,  we  only  make  a  few  brief 
comments.  There  are  at  least  three  well-known  methods  to  estimate  para¬ 
meters:  maximum  likelihood,  graphical  and  linear.  Maximum  likelihood  me¬ 

thods  contain  some  computational  and  theoretical  difficulties.  The  likeli¬ 
hood  equations  cannot  be  solved  explicitly,  and  hence  numerical  algorithms 
must  be  used.  For  the  two-parameter  Weibull  distribution,  the  order  sta¬ 
tistics  are  sufficient  but  not  complete.  Hence,  small-sample  optimality 
properties  of  the  maximum- like 11 hood  estimators  can  not  be  directly  appealed 
to.  Instead,  asymptotic  results  must  be  used. 

Graphical  techniques  are  based  on  a  graphical  estimation  of  the  cumula¬ 
tive  distribution  function  based  on  the  order  statistics.  The  procedure 
consists  of  plotting  failure  times  versus  percent  of  sample  that  has  failed 
on  Weibull  probability  paper.  Theoretically  this  data  should  lie  on  a 
straight  line.  Due  to  sampling  errors,  there  will  be  some  deviations  from 
the  theoretical  straight  line,  and  so  a  least  square  line  is  visually  fit  to 
the  data  points.  The  slope  of  this  line  provides  an  estimate  of  0,  and  the 
time  point  corresponding  to  63%  failures  provides  an  estimate  of  $.  This 
graphical  technique  has  been  programmed  on  an  APPLE  computer  graphics  system 
by  the  authors.  This  provides  one  the  advantage  of  being  able  to  visually 

notice  the  degree  of  fit  of  the  data  to  the  model.  In  addition,  an  exact 
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least  squares  line  can  be  easily  fit  to  the  data.  In  order  to  use  this  pro¬ 
gram,  the  test  engineer  only  needs  to  input  the  times  to  failure.  The  rest 
is  done  by  the  computer,  including  the  estimates  of  the  parameter.  An  exam¬ 
ple  of  this  graphical  technique  on  the  APPLE  computer  system  is  provided  in 
Figure  1  .  The  drawback  of  this  method  is  that  it  assumes  testing  until  all 
items  fail,  or  at  least  a  large  parentage.  Under  Type  I  or  II  censoring, 
this  procedure  has  to  be  modified. 

Linear  techniques  are  based  on  a  few  ordered  statistics  and  are  designed 
to  provide  best  linear  estimates  under  certain  assumptions.  They  are  easy  to 
use  and  are  adaptable  to  computer  use  by  the  maintaining  of  weights  for  the 
order  statistics  in  a  computer  data  base. 

Full  descriptions  of  these  and  other  parameter  estimation  techniques 
are  contained  in  [  3  ] . 
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Testing  Procedure 

The  testing  procedure  suggested  by  the  foregoing  analysis  is  as  follows: 

(a)  Determine  the  initial  burn-in  time,  t^. 

(b)  Based  on  the  number  of  failures  observed  to  time  t^,  estimate  § 
and  p.  Calculate  y  »  tg/6. 

(c)  Locate  the  values  of  y  and  P  on  a  chart  and  determine  L  and  h. 

x 

is  the  expected  proportional  extra  time  that  a  failure-free 
test  would  run  and  h  is  the  proportion  of  those  systems  still 
operating  which  will  be  expected  to  fail  during  that  time. 

(d)  On  the  basis  of  and  h,  make  a  decision  to  either 

(i)  Accept  those  components  that  are  still  operating,  and  re¬ 
test  the  failed  components  in  a  failure-free  mode.  This 
decision  might  be  taken  if  h  were  very  low. 

(ii)  Extend  the  test  time  on  all  systems  to  the  new  fixed  time, 
tQ(l  +  L^).  At  that  time,  accept  those  components  still 
operating.  This  decision  may  be  taken  if  the  value  of  h 
is  high,  since  by  extending  the  test  time  there  would  be 

a  high  probability  of  finding  more  defectives. 

Numerical  Examples 

(a)  A  set  of  n  *100  avionic  systems  is  burned-in  for  an  initial  period 
of  tg  *  10  hours.  At  the  end  of  this  time  period,  15  systems  have 
failed  and  85  are  still  operating.  Based  on  the  failure  times  of 
these  components,  estimates  of  p  and  6  are  obtained  as 

P  -  0.56 

A 

6  -  320 

An  estimate  of  y  is  then 

*  -  t Q/l  -  .031 
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From  Figure  A-3,  *  ,15  while  from  Figure  3-2,  h  =  ,85.  Hence, 

(i)  The  expected  length  of  a  failure-free  test  is  an  additional 
1.5  hours  (10  x  .15)  per  component. 

(ii)  If  1.5  hours  of  extra  burn-in  time  are  given  equally  to  each 
of  the  systems  still  working,  we  would  expect  one  additional 
new  failure  (85  x  .01  «  .85,  rounded  off  to  one).  If  this 
fixed- time  procedure  were  done  with  replacement,  then  (using 
Equation  27),  we  would  expect  one  repeat  failure  (.73,  rounded 
off  to  cne) . 

(iii)  If  the  failure-free  method  were  followed,  we  would  expect 
(using  Equation  26)  an  additional  two  repeat  failures  (2.05, 
rounded  off  to  two) . 

(b)  Under  the  same  situation  as  above,  the  estimates  of  p  and  $  are: 

8  -  .32 
6  •  66.7 

An  estimate  of  y  in  this  case  Is 

A  A 

Y  “  tg/6  “  -15 

From  Figure  A-4,  the  value  of  is  given  as  .7,  while  from  Figure  B-3, 

the  value  of  h  is  given  as  .09.  In  this  case, 

(i)  The  expected  length  of  a  failure-free  test  is  an  additional 
7  hour 8  (10  x  .7)  per  component. 

(ii)  If  7  hours  of  extra  burn-in  time  are  given  equally  to  each  of 
the  systems  still  working,  we  would  expect  eight  additional 
new  failures  (85  x  .09,  rounded  off  to  eight).  If  this  fixed¬ 
time  procedure  were  done  with  replacement,  then  (using  Equation 
27)  we  would  expect  6  repeat  failures  (5.8,  rounded  off  to  six). 

(iii)  If  the  failure-free  method  were  followed,  we  would  expect  (using 
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Equation  26)  an  additional  30  repeat  failures  (30.44,  rounded 
off  to  30). 

Since  9%  of  the  working  systems  would  thus  fail  in  the  next  7  hours, 
the  decision  may  be  made  to  extend  the  test  an  additional  7  hours  and 
accept  those  systems  still  operating  at  the  end  of  that  time. 
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9.  Conclusion  and  Recommendations 

Based  on  these  results ,  a  decision  on  the  use  of  a  failure-free  test 
should  depend  on  the  values  of  the  parameters  of  the  life-length  distribu¬ 
tion.  Under  certain  conditions  a  failure-free  test  is  desirable,  while  under 
other  conditions  it  makes  more  sense  to  extend  the  test  length  evenly  among 
all  components.  This  report  contains  charts  which  provide  the  test  engineer 
with  the  information  necessary  to  make  this  decision. 

An  assumption  underlying  this  report  involves  Independence  and  identical 
distribution  (iid)  of  the  pre  and  post-repair  lifetimes.  On  the  other  hand, 
this  model  allows  for  aging  of  components  through  the  Weibull  failure  dis¬ 
tribution.  Models  which  assume  an  exponential  failure  distribution  automa¬ 
tically  imply  iid  pre  and  post-repair  lifetimes  due  to  the  memoryless  property 
of  the  exponential.  Therefore,  the  Weibull  assumption  is  a  closer  approxima¬ 
tion  than  the  exponential  assumption.  A  possible  extension  of  this  research 
would  be  a  relaxation  of  the  iid  assumption  resulting  in  a  better  approxima- 

i 

tion  to  an  actual  reliability  test. 
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Conditional  Failure  Probability  Charts 
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